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Abstract. In this article we consider a priori error and pointwise estimates for finite 
element approximations of solutions to semilinear elliptic boundary value problems in 
d ^ 2 space dimensions, with nonlinearities satisfying critical growth conditions. It 
is well-understood how mesh geometry impacts finite element interpolant quality, and 
leads to the reasonable notion of shape regular simplex meshes. It is also well-known 
how to perform both mesh generation and simplex subdivision, in arbitrary space dimen- 
sion, so as to guarantee the entire hierarchy of nested simplex meshes produced through 
subdivision continue to satisfy shape regularity. However, much more restrictive angle 
conditions are needed for basic a priori quasi-optimal error estimates, as well as for a 
priori pointwise estimates. These angle conditions, which are particularly difficult to 
satisfy in three dimensions in any type of unstructured or adaptive setting, are needed 
to gain pointwise control of the nonlinearity through discrete maximum principles. This 
represents a major gap in finite element approximation theory for nonlinear problems on 
unstructured meshes, and in particular for adaptive methods. In this article, we close this 
gap in the case of semilinear problems with critical or sub-critical nonlinear growth, by 
deriving a priori estimates directly, without requiring the discrete maximum principle, 
and hence eliminating the need for restrictive angle conditions. Our main result is a type 
of local Lipschitz property that relies only on the continuous maximum principle, to- 
gether with the growth condition. We also show that under some additional smoothness 
assumptions, the a priori error estimate itself is enough to give L°° control the discrete 
solution, without the need for restrictive angle conditions. Numerical experiments con- 
firm our theoretical conclusions. 
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1. Introduction 



In this article we consider a priori error estimates and discrete pointwise estimates for 
Galerkin finite element approximation of solutions to a general class of semilinear prob- 
lems satisfying certain growth conditions in d space dimensions, which includes prob- 
lems with critical and subcritical polynomial nonlinearity. When d = 2 and d = 3, it is 
well-understood how mesh geometry impacts finite element interpolant quality (cf. [3]). 
Such considerations lead to the requirement that simplex meshes used for finite element 
approximation satisfy a reasonable mesh condition known as shape regularity. It is well- 
known how to perform both mesh generation and simplex subdivision, in arbitrary space 
dimension, so as to guarantee that the entire hierarchy of nested simplex meshes pro- 
duced through subdivision satisfy shape regularity, and continue to do so asymptotically 
(cf. [1, 4, 5, 29]). However, much more restrictive angle conditions are needed for ba- 
sic a priori quasi-optimal error estimates, as well as for a priori pointwise estimates for 
Galerkin finite element approximations. These angle conditions, which are particularly 
difficult to satisfy in three dimensions in any type of unstructured or adaptive setting, are 
needed to gain pointwise control of the nonlinearity through discrete maximum princi- 
ples. This represents a major gap in finite element approximation theory for nonlinear 
problems on unstructured meshes, and in particular for adaptive methods. In this ar- 
ticle, we close this gap in the case of semilinear problems with critical or sub-critical 
nonlinear growth, by deriving a priori estimates directly (both error estimates and dis- 
crete pointwise estimates), without requiring the discrete maximum principle, and hence 
eliminating the need for restrictive angle conditions. 

Critical exponent problems arise in a fundamental way throughout geometric analysis 
and general relativity. One of the seminal critical exponent problems in nonlinear PDE 
is the Yamabe Problem [2]: Find u G X (for some appropriate space X) such that 



where Q is a Riemannian 3-manifold, g is the positive definite metric on Q, A g is the 
Laplace-Beltrami operator generated by g, R is the scalar curvature of g, and R u is the 
scalar curvature corresponding to the conformally transformed metric: g = <p A g. The 
coefficients R and R u can take any sign. The Banach space X containing the solution 
is an appropriate Sobolev class W s,p (Vt) for suitably chosen exponents s and p. If the 
manifold has a boundary, then boundary conditions are also prescribed, such as u = 1 
on an exterior boundary to f2. In the case that fid 3 , and g^ = 8^, then A g reduces 
to just the Laplace operator on f2. This problem is full of features that are challenging 
both mathematically and numerically, including: critical exponent nonlinearity, poten- 
tially non-monotone nonlinearity, spatial dimension d ^ 3, and spatial domains that are 
typically non-flat Riemannian manifolds rather than simply open sets in R. d . A related 
critical exponent problem, containing all of the difficulties of (1.1)-(1.2), plus the addi- 
tion of low-order non-polynomial rational nonlinearities, arises in mathematical general 
relativity in the form of the Hamiltonian constraint equation; cf. [21]. 

The presence of the term u 5 term in (1.1), and in the related Hamiltonian constraint 
in general relativity, is an example of a critical exponent problem in space dimension 
three; such problems are known to be difficult to analyze due to the loss of compact- 
ness of the embedding H 1 C L p+1 , where the dimension-dependent critical exponent 
p = (d + 2)/(g? — 2) takes value p = 5 when d = 3. Loss of compactness of the em- 
bedding creates obstacles that prevent the use of compactness arguments in standard 



-8A g u + Ru = R u u 5 
u > 0, 



in f2 



(1.1) 
(1.2) 



FE ERROR ESTIMATES FOR CRITICAL GROWTH SEMILINEAR PROBLEMS 3 

variational, Galerkin, and fixed-point techniques. As a result, these techniques are gen- 
erally restricted to subcritical nonlinearities, unless additional techniques give control of 
the nonlinearity, such as a priori L°°, or pointwise, control of solutions. The inequality 
constraint (1.2) creates additional complexities in both theory and numerical treatment 
of such problems, with only positive solutions having physical meaning. Prior work on 
numerical methods for critical exponent semilinear problems has focused primarily on 
the development of adaptive methods for recovering solution blowup; cf. [8, 7]. 

The standard approach to obtaining a priori L°° bounds on Galerkin approximations 
is to enforce approximation space properties to guarantee discrete maximum principles, 
leading to geometrical conditions on the underlying simplex mesh. In the case of the 
d = 2 Poisson problem, discrete maximum principles can be established if all angles in 
the triangulation are non-obtuse (cf. [12]). This was relaxed to "summation of two oppo- 
site angles less or equal to ix" in [42, Page 78] (the so-called nonnegative triangulation 
in [17]). In some cases discrete maximum principles hold more generally [33]. How- 
ever, counter-examples indicate angle conditions cannot be relaxed as sufficient condi- 
tions [17]. For variable coefficients, anisotropic versions of non-obtuse angle conditions 
are required for discrete maximum principles (cf. [24]). The same angle conditions are 
needed in the nonlinear case [26, 25, 44, 9]. Due to the central role angle conditions play, 
there is a growing literature on generating non-obtuse meshes [28]. Other approaches for 
obtaining L°° estimates using local analysis include [36, 34]; related work on a priori L°° 
error estimates include [6, 38]. In [15], quasi-optimal W l,co error estimates are estab- 
lished under "large-patch" local quasi-uniformity conditions. The proofs of the discrete 
maximum principle and the L°° error estimates in the aforementioned works are quite 
technical, and have been limited to linear finite elements. We note that [15] has an ex- 
tensive overview of L°° error estimates, and also relevant is [14] on localized pointwise 
(and negative norm) estimates for more general quasilinear problems. Finally, we note 
other relevant semilinear work includes [9, 20, 46, 47, 48]. 

The need for angle conditions to gain pointwise control of the nonlinearity through 
discrete maximum principles represents a major gap in finite element approximation 
theory for nonlinear problems on unstructured meshes, and is a particularly disturbing 
problem in the case of adaptive methods that guarantee only shape regularity of meshes 
produced through subdivisions. In this article, we close this gap in the case of semilinear 
problems with critical or sub-critical nonlinear growth, by deriving a priori estimates 
directly (both error estimates and discrete pointwise estimates), without requiring the 
discrete maximum principle, and hence eliminating the need for restrictive angle condi- 
tions. Our main result is proving a type of local Lipschitz property for Galerkin finite 
element approximations for solutions to problems with nonlinearities having critical and 
subcritical growth bounds, using only a priori L°° control of the continuous solution, 
together with other results that are independent of the approximation space. This result 
allows us to then establish, in successive order, quasi-optimal a priori energy error es- 
timates for Galerkin approximations, L 2 estimates via duality arguments, L°° estimates 
via inverse-type inequalities, giving finally a discrete L°° bound without a discrete maxi- 
mum principle, and therefore without requiring angle conditions beyond shape regularity. 
Although the techniques we use here are completely different, our results on obtaining 
a priori estimates without angle conditions can be viewed as complementing the 2006 
work of Nochetto, Schmidt, Siebert, and Veeser [30] on a posteriori estimates without 
angle conditions, for a similar class of monotone semilinear problems. However, while 
some of our results require monotone nonlinearity, several results are established under 
weaker conditions (see Assumptions (A3') in Section 2). 
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Outline of the paper. The remainder of the paper is structured as follows. In Section 2 
we describe a general class of semilinear problems, and under various assumptions de- 
rive a priori L°° bounds for solutions using cutoff functions and the De Giorgi iterative 
method (or Stampacchia truncation method). In Section 3, we develop quasi-optimal a 
priori error estimates for Galerkin approximations, where the nonlinearity is controlled 
only using a type of local Lipschitz property. While the Lipschitz property is usually 
proved using discrete maximum principles and L°° control of the discrete solution, we 
establish this result for nonlinearities having critical and subcritical growth bounds using 
only a priori L°° control of the continuous solution, together with other results that are 
independent of the approximation space. In Section 4 we then use standard duality argu- 
ments to obtain corresponding L 2 error estimates. Using inverse-type inequalities in the 
finite element approximation space, we then show that the discrete solution indeed has 
a uniform a priori L°° bound, without having access to the discrete maximum principle, 
and therefore without requiring restrictive angle conditions on the underlying finite el- 
ement mesh. Finally, in Section 5 we examine the predictions made by the theoretical 
results through a sequence of numerical experiments. 

2. Semilinear Problems and A Priori L°° Estimates 

In this section, we give an overview of a class of nonlinear elliptic boundary value 
problems on a bounded Lipschitz domain ft C R d with d = 2 or d = 3. To begin 
with, we introduce some standard notation. Given any subset G C R d , we use standard 
notation for the L P (G) spaces for 1 ^ p ^ oo, with the norm || • \\q >P) g- We use standard 
notation for Sobolev norms \\v ||fc,2,G — IMI.h*(g) f° r me Sobolev space H k (G). For any 
function v E L P (G) and w E L q (G) with p, q ^ 1 and 1/p + 1/q = 1, we denote 
the pairing (v,w)q := J G vwdx. For simplicity, when G = ft, we omit if from the 
norms (or pairing). Given a function g defined on T = <9ft, we define the affine space 
of as Hg(fl) := {v E H 1 ^) : v\r — g}. In particular, we have the following 

Poincare-Sobolev inequality 

H|o, P ^C s (p)||Vw||o, 2 , VaG^fO), (2.1) 

where p < oo if d = 2 and p — 2d/(d — 2) if d ^ 3, and the constant C s (p) depends 
only on p and f2. In the sequel, we simply denote C s := C s (2) when p = 2 in (2.1). 
We consider the following semilinear elliptic equation: 

- V • (DVu) + b(x, u) = f{x) in ft, u\ r = g, (2.2) 

with the following assumptions: 
(Al) The diffusion tensor D : R d -»■ R dxd E L°°(ft) satisfies that 

m|e| 2 ^ fD£ <: M|e| 2 , V£ E R d , 

for some constant m,M > 0. 
(A2) f(x) E L 2 (VL) and g E L°°(T). 

(A3) 6:ftx]R— >• R is a Caratheodory function, i.e., for any given (6R the function 
&(■,£) : ft — > K is measurable on ft, and for any given x E ft the function 
b(x, ■) : K — > R is smooth (cf. [18, Definition 12.2]). In the sequel, we will 
simply write b(u) instead of b{x, u), and assume that b is monotone: 

b'(0 >0 7 E R. (2.3) 
Without loss of generality, we also assume that 6(0) = 0. 
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(A4) b satisfies the growth condition: there exists an integer n with 1 ^ n ^ ^| if 
d ^ 3 and l^n<ooif<i = 2 such that 

\b^(0\<K, V(GK, (2.4) 

for some constant K > 0. 
The weak form of (2.2) reads: Find u G Hg(Q) such that 

a(u,v) + (b(u),v) = (f,v), \fv e H*{Q), (2.5) 

where a(u, v) := f n DVu ■ Vvdx. 

Before moving on, we make some brief comments about Assumptions (A1)-(A4). 
Assumption (Al) on the coefficient D implies that the bilinear form a(-, ■) is coercive 
and continuous, namely, 

H|Vv||o, 2 ^ a(v,v) mda(v,w) ^ M|| Vf||o )2 || Vw|| ,2, Vv,w G H^(fl). (2.6) 

This implies that the induced energy norm |||t>||| = \/a(v, v) is equivalent to the H 1 
semi-norm. The Assumption (A3) implies that 

(b(v) -b(w),v-w) ^0, Vf , w G Hq(Q). (2.7) 

While a number of our results rely in the monotonicity Assumption (A3), we establish 
several key results under a weak condition (see Assumption (A3') below) that allows 
for non-monotone nonlinearities. Finally, note that Assumption (A4) holds when b is a 
polynomial with degree up to (including) critical exponents. This assumption includes 
as examples the Yamabe problem, as well certain special cases of the Hamiltonian con- 
straint in the Einstein equations mentioned in the introduction. 

In the remaining of this section, we try to establish a priori L°° bounds on the solu- 
tion to (2.5) through maximum/minimum principles, which is quite standard in the PDE 
analysis (see for example [19, 43]). Since it is important for our subsequent analysis, we 
include a proof of a priori L°° bounds on weak solutions using the de Giorgi iterative 
method (cf. [13, 41]), which relies on the following lemma. 

Lemma 2.1. Let be a non-negative and non-increasing function on [s , oo) satisfy- 

for some constant A > 0, a > and (3 > 1. Then 

For a proof of this lemma, we refer to [45, Lemma 4.1.1] or [10, Lemma 12.5]. By 
using this lemma, we are able to give explicit a priori L°° bound of the solution to (2.5). 

Theorem 2.2. Let the Assumptions (A1)-(A3) hold, and u G HgiVt) be a weak solution 
to (2.5). Then 

u ^ u(x) ^ u, a.e. x G Q, (2.8) 

where u and u are defined as 

u = min{0,essinf xean 5((x)} - C||/|| ,2, (2.9) 
u = max{0,esssup 1 . ean 5f(x)} + C||/|| ,2- (2.10) 

C 2 (v) p~ 4 p~ 2 

Here the constant C = 3 w \Q\ 2 ^ 2p- 4 , where we choose p > 4 (when d = 2) or p = 6 
(when d = 3), and C s (p) is the Poincare-Sobolev constant in (2.1). 
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Proof. To prove the upper bound of (2.8), let s = max{0, ess sup^^ g(x)} and define 
a test function 

<p + (x) = (u(x) — r) + := max{«(i) — r, 0} 

with r ^ so ^ 0. Let G(r) := {x E £1 : u(x) > r}. By the choice of r, it is obvious that 
4> + E Hq(Q), and it satisfies 

V0 + (x) = V«(i), for a.e. x E G(r). 

By coercivity of a (•,•), we have 

m / |V0 + | 2 rfx < a(0 + ,0 + ) = a(u,(f) + ) 
Jn 

= (/-%),0 + )^(/,0 + ), 

where in the last step we have used the assumption (2.3) in (A3). Hence, 



|V0 + || 2 2 < - f \f<j> + \dx. 



By Sobolev embedding theorem, ||0 + ||o, P ^ C s (p) || V0 + ||o,2 forp E (2, oo) when d = 2, 
or p = 6 when d = 3, we obtain 



m Jn 



where the Poincare-Sobolev constant C s (p) depends only on the dimension d and f2. 
Notice that (p + (x) = when x E £1 \ G(r), by Holder inequality we obtain 

H^lloW) < ^ / \f<f> + \dx < ^IGWI^-II^IIo^wII/IIoaow, 

m JG{r) 171 

This implies 

ll^ + llo lft o(r)<-^|G(r)|*-5||/||o |a . (2-11) 
m 

Note that when s > r, we have G(s) C G(r) and + ^ s — r on Therefore, 

H + h P ,G { r) > II0 + IIo, p ,g(,) >{s- r)|G(s)| 1/p . (2.12) 
Combining the inequalities (2.1 1) and (2.12), we obtain 

ICuJgffik)' ' | G[r) |i- 

Now, by letting ip(s) = \G(s)\, a = p, (3 = p/2 - 1 > 1 and A = C%(p)\\f\\o t2 /m in 
Lemma 2.1, we obtain 



C?(p)||/|| ,2 o E= 



p — 4 



\G(s)\=0, VOso+ 8K "" J " u '' 2t=*\G(so)\* 

m 

By definition of G(s), this means that 

C 2 (n) P-4 p-2 

u ^ sq -\- — f2 ~2p~2,p~^ II /II o 2 a.e. in ft, 
m 

which proves the upper bound. 

The proof of lower bound of (2.8) is similar. We define s : = max{0, —ess inf^n g(x)}, 
and define the test function E Hq(Q) as 

<p~(x) := (m(x) + r)~ = min{-u(x) + r, 0} ^ 0, 
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for some r ^ s ^ 0. Similarly, we introduce the subset G(r) := {x G Q : u(a;) < — r}. 
Then by monotonicity (2.3) in (A3), one get that 



m / \V4>-\ 2 dx ^ a(u,4>-) = (f - b(u),4>-) <: (f,4>-). 



|v0 12 

'n 

The remaining of the proof are identical as the proof of upper bound, we omit the details 
here. This completes the proof. □ 

Remark 2.3. There are obviously other methods for showing Theorem 2.2. Clearly, one 
of the main benefits of using the de Giorgi iterative Lemma 2.1 is that it gives explicit 
bounds in the estimates. Notice that in Theorem 2.2 we only use the conditions (Al )-(A3), 
so this theorem can be applied to a large class of nonlinear PDE problems, including the 
super-critical ones as for the regularized nonlinear Poisson-Boltzmann equation (cf. [9, 
20]). In the case of the Hamiltonian constraint application, we would have b(u) = 
aRii + a T u 5 — a p u~ 3 — a w u~ 7 . This definition ofb satisfies the condition if we assume 
that a T ^ 0, a p ^ and a w ^ 0. 

Finally, we note that the de Giorgi iterative argument can also applied to establish 
discrete maximum/minimum principles, which give rise to discrete a priori L°° bounds 
for the discrete solution; see [25, 44] for more detail. However, in the discrete setting, 
it requires certain angle conditions in the underlying mesh in order to guarantee that 
the stiffness matrix is an M-matrix; this is what we wish to avoid in this paper, and will 
therefore take another approach in the following sections. 

We should also remark that the monotonicity assumption (2.3) is not essential for the 
maximum/minimum principles. In the remaining of this section, we should give another 
simple approach to show the L°° bounds of the continuous solution with a slightly more 
general assumption on b. The following assumption on the nonlinearity allows for a class 
of functions containing both monotone and non-monotone cases: 

(A3') 6:f2xIR— R is a Caratheodory function, which is barrier monotone in its 
second argument: there exist constants a, f3 G R, with a ^ (3, such that 

b(x, s) - f(x) ^0, Vs ^ 0, a.e. in Q. 

b(x, s) — f(x) ^0, Vs ^ a, a.e. in Q. 

We have the following theorem based on the Assumptions (Al), A(2) and (A3'): 

Theorem 2.4 (A Priori L°° Bounds). Let the Assumptions (Al)-A(2) and (A3') hold. Let 
u G H^iVt) be any weak solution to (2.5). Then 

u^u^u, a.e. in Q, (2.13) 

for the constants u and u defined by 



u := max < /?, sup g(x) > , u:= min < a, inf g(x) \ , (2.14) 

where a ^ /3 are the constants in Assumption (A3'). 
Proof. To prove the upper bound, let us introduce 

= (u — u) + = max{w — u, 0}. 

By the definition of u, it follows (cf. [40, Theorem 10.3.8]) that G Hi (Q) and ^ 
a.e. in Vt. Taking v = <p in (2.5), we have 

a(u, 0) = a(u — u,4>) = a(4>, <p) ^ m||V0||o 2 - 
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This implies that 

H V0|lo,2 < <u, <P) = (/ " < 0, 

since / — < a.e. in the support of <p, that is, ||V0||o,2 = which yields = 0. 
Therefore, the upper bound of (2.13) holds. 
Similarly, we introduce 

if) = (u — u)~ = min{w — u, 0}. 

It is obvious that ip G Hq(Q) can be used as a test function in (2.5). Moreover, ip ^ a. 
e. in f2, and Assumption (A3') implies / — b(u) ^ on the support of ip. Therefore, 

m\\ViP\\l 2 ^a(u,iP) = (f-b(u),iP)^0, 

which implies ip = as before. This proves the lower bound of (2.13). □ 

The same technique in Theorem 2.4 can be applied in the discrete setting, again with 
additional assumption on the mesh. In fact, if the triangulation satisfies that 

a(0i,0j)^O, Vi^J 

where (pi and <pj are the basis functions corresponding to the vertices i and j respec- 
tively, then the conclusion of Theorem 2.4 still holds for the finite element solution Uh- 
However, this is out of the scope of this paper. For more details, we refer to [23]. 

3. Quasi-optimal Estimates without Angle Conditions 

In this section, we consider the finite element approximation of (2.5) and derive quasi- 
optimal error estimates without angle conditions of any type. Without loss of generality, 
we assume that the Dirichlet data satisfies g = for ease of exposition. Let Th be a 
quasi-uniform triangulation of VL. We emphasize that the triangulation does not require 
any particular angle conditions other than the quasi-uniformity. Let Vh = {v G Hq(Q) : 
v\t G Pfc(T), VT G Th} be the standard finite element space defined on Th, where 
Pfc(T) (k ^ 1) is the space of polynomials of degree ^ k define on T. The finite element 
discretization of (2.5) reads: Find u h G Vh such that 

a(u h ,v) + (b(u h ),v) = (f,v), \/v G V h . (3.1) 

We remark that here we do not require the finite element space to be piecewise linear, 
which is a requirement in most literature for discrete maximum/minimum principles. 

We now give a simple lemma that establishes a a priori energy bounds on solutions 
to (2.5) and (3.1) that are independent of most features of the problem; these bounds will 
be critical for proving quasi-optimal error estimates without mesh conditions. 

Lemma 3.1. Let the Assumptions (Al )-(A2) hold, andu G H^(Q) anduh G Vh C 

be the weak solutions to (2.5) and (3.1), respectively. If the nonlinear function b satisfies 

(A3) then the following a priori energy bounds hold: 

||Vit|| 0l 2<— imio,2, (3.2) 
m 

||V^||o )2 <^||/||o )2 , (3-3) 
where C s is the Poincare-Sobolev constant in (2.1). 
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Proof. If b satisfies Assumption (A3), then (b(u),u) ^ by (2.3), which implies that 

|||u||| 2 = a(u,u) = (/- b(u),u) < ||/||o,2|M|o,2- 
Therefore, the coercivity (2.6) of a(-, •) and Poincare inequality (2.1) imply that 

iivwiio, 2 <— imio, 2 , 

m 

which shows the inequality (3.2). The inequality (3.3) follows by the same arguments. 

□ 

With a certain convenient assumption (3.4) on the nonlinear function b that we will 
examine in more detail shortly, we can easily obtain the following quasi-optimal a priori 
error estimate for Galerkin approximations. 

Theorem 3.2. Let the Assumptions (Al )-(A3 ) hold, and u and Uh be the solutions to (2.5) 
and (3.1) respectively. Assume that there exists a constant C l such that 

(b(u) - b(u h ),v) ^ C L \\V(u - OUo,2||V<;||o,2, Vi; e H^Q). (3.4) 
Then we have the following quasi-optimal error estimate: 

l\u — Uh\\\ ^ f 1 H | inf \\\u - v\\\. 

\ m J vev h 

Proof. By subtracting equation (3.1) from (2.5), we have 

a(u - u h ,v) + (b(u) - b(u h ),v) = 0, Vt> G V h . 
By using this identity, we obtain 

\\\u - u h \\\ 2 = a(u — u h ,u- u h ) 

= a(u — Uh,u — v) + a(u - u h , v - u h ), Vv e V h 
= a(u -u h ,u-v) + (b(u) - b(u h ),u h - v) 

= a(u -Uh,u-v) + (b(u) - b(u h ),u - v) - (b(u) - b(u h ),u - u h ), 

^ — W/i||||||w — v|| + (b(u) — b(uh),u — v), 

where we used the monotonicity (2.3) of b in Assumption (A3). 
Now by assumption (3.4) and coercivity (2.6), we have 

Cl 

(b(u) - b(u h ),u-v) < Cx||V(u-Ufc)||o l2 ||V(ii-i;)||o,2 < — ||w - u h \l\\\u - v\\. 

m 

The conclusion then follows since v & Vh is arbitrary. □ 

Remark 3.3. We note that the monotonicity assumption (2.3) in Assumption (A3) as 
used in Theorem 3.2 may be weakened in several different ways. Such a generalization 
of Theorem 3.2, using an argument based on L 2 -lifting, can be found in [22]. 

Theorem 3.2 provides a general framework for establishing quasi-optimal a priori 
error estimates for (2.2). The key is to realize the assumption (3.4), which is a relaxation 
of the Lipschitz continuity of b. For nonlinearities that are not Lipschitz continuous, 
a standard approach to deriving inequality (3.4) is to use continuous and discrete L°° 
bounds on u and uu, as was done in [9, 20] for the Poisson-Boltzmann equation. In this 
approach, since ||m||o,oo ^ M% and ||«/i||o )0 o ^ M 2 , one can control the nonlinear term as 

b(u) - b(u h ) = b'(u h + t(u - u h ))(u - u h ) 

< \\b'(u h + t(u - u h ))\\ 0)OO \\u - U h \\ ,2, 
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for some t G [0, 1]. In this way, one can easily obtain (3.4). Unfortunately, this approach 
requires a priori L°° bounds on Uh- The standard approach for obtaining such a priori 
L°° bounds on Uh is by discrete maximum principles, which requires restrictive angle 
conditions. These angle conditions are particularly difficult to satisfy in the unstructured 
and adaptive settings, especially in three space dimensions. 

However, with the help of the growth condition (A4), it is actually possible to establish 
assumption (3.4) without employing discrete L°° bounds on Uh, and hence without any 
assumptions on the mesh at all. The remainder of this section is devoted to proving this. 
We first note that Lemma 3.1 gives both continuous and discrete a priori energy bounds 
that depend only on ||/||o,2> an d on the coercivity constant m and the Poincare-Sobolev 
constant C s . In particular, there is no dependence on the discretization parameter h in the 
case of the bound for Uh- Hence, combining (3.2) and (3.3) with the triangle inequality 
and coercivity of a (•,•), we obtain 

2C 

||V(u-Ollo,2<— 11/110,2=:^, (3.5) 
m 

where R = R(C s ,m, ||/||o,2) is a constant independent of h. This observation makes 
possible the following local Lipschitz result. 

Theorem 3.4. Let the Assumptions (A1)-(A4) hold, and let u and Uh be the solutions 
to (2.5) and (3.1), respectively. Then 

(b(u)-b(u h ),v) < C L ||V(w-^)||o,2||V^|| 0) 2, Vv e iJ^fi), (3.6) 

where Cl = Cl{Q, f, IMIoo, d, n, m) is independent of h. 

Proof. We begin with the Holder inequality 

(b(u) - b(u h ),v) ^ \\b(u) - &K)ll 0lP IMIo, P *, (3-7) 

where 1/p + 1/p* = 1. For d ^ 3, we take p* = 2d/(d - 2) and p = p*/(p* - 1). 
The choice of p* allows us to use the Poincare-Sobolev inequality (2.1) for \\v \\o, p *. For 
example, when d — 3, we take p* = 6 and p = 6/5. For d = 2, we may take any 
1 < p* < oo andp = p*/(p* — 1). 
Notice that by Taylor expansion, we can write b(u) — b{uh) as the finite sum 

n-l 

b(u) - b(u h ) = Ti b{k) ^ u ~ Uh ) k + -/ n) ^(u - u h )\ (3.8) 

k=l 

for some £ E Hq(£1). By Theorem 2.2 , the Assumptions (Al)-A(3) implies the a priori 
L°° bound of u (2.8), hence Hfc^^Hcoo is bounded for k = 1, 2, • • • , n — 1. On the 
other hand, we have ||&^(£)||o,oo < K by Assumption (A4). Therefore, by Minkowski 
inequality and (3.8) we obtain 

n / n \ 

||%) - b(u h )\\ , p ^ Ci W( u - ^)1o, P = Ci HT \\u - u h \\l kp , (3.9) 

fe=l \/c=l / 

where C\ := Ci({l,b,u,u, K,n) is a constant independent of h. Our range of k and 
choice of p ensures that 1 ^ kp ^ np ^ p* when d ^ 3, and 1 ^ kp ^ np < oo when 
d = 2. Therefore, by (2.1) we have 

\\u-u h \\o tkp ^C s (kp)\\V(u-u h )\\o,2, l^k^n. (3.10) 
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Using this together with (3.5), we have 



W u - Uh tk P < C k s (kp)\\V(u - u h ) 



1 0,2 

k=l k=l 



,Jfe=l 



= max C s (lp) — ^r— V(u - u h )\\ 0>2 

l<Z<n \1 - || V(tt - U h )\\o t2 J 

^C 2 (n,n,R)\\V(u- u h )\\ , 2 , (3.11) 

where R is the constant defined (3.5). Here we assumed R ^ 1 without loss of generality. 
Finally, using Sobolev inequality on \\v\\ p * and Combining (3.7), (3.9), and (3.11) now 
gives (3.6) with C L = dCzCsip*). ' ~ □ 

As a result of Theorem 3.2 and the standard interpolation error (see (4.1)), we obtain 
the following quasi-optimal error estimate: 

Corollary 3.5. Let the Assumptions (Al )-(A4) hold, and u and Uh be the solutions to (2.5) 
and (3.1), respectively. If in addition u G H S (Q) n Hq(Q) for some s > 1, then 

Wlu-UhH^Ch^WuW^, (3.12) 

where C is a constant independent ofh. 



4. Discrete L°° Error Estimates without Angle Conditions 

Once we obtain the quasi-optimal error estimate in Theorem 3.2, we can use it to ob- 
tain the error estimates in other norms, such as L 2 and L°° estimates. In this section, 
we first derive an L 2 -error estimate using the standard Aubin-Nitsche technique. This 
L 2 -error estimate is not only of its own interest, but also has several applications. For 
example, we can use it to obtain some L°° error estimates using certain inverse-type 
inequalities. The L°°-error estimate will then subsequently be useful in establishing a 
priori L°° bounds for the discrete solution, without any type of angle conditions. In 
other words, rather than first imposing restrictive angle conditions to get a discrete max- 
imum principle, and using this to control the nonlinearity to get a a priori error estimate, 
we essentially turn things around and use the error estimates to establish a priori L°° 
estimates of the discrete solution u^. 

We assume that the solution u to (2.5) satisfies the regularity u E H s (il) PI Hq(Q.) for 
some s > 1. Recall that given the quasi-uniform triangulation Th with the mesh-size h, 
there exists an interpolation operator Xh : H s (tt) fl H^tt) — > Vh such that the following 
standard interpolation error estimates hold (cf. [11, Chapter 3] or [39]): 

\\v-l h v\\ S(h2 ^Ch s - S0 \\v\\ s , 2 , s = 0,1, (4.1) 

and 

||^-X^||o,oo^C^ d / 2 ||i;|| Si2 , (4.2) 

for any v G H S (Q) f]H^(Q). In fact, (4.1) has been used to obtain (3.12) in Corollary 3.5. 

We now derive the L 2 error estimate for u — Uh by duality argument. To begin, let us 
introduce the following linear adjoint problem: Find w G Hq (fi) such that 

a(v , w) + (b'(u)v , w) = (u — Uh, v ), v G H (Q). (4.3) 



12 R. BANK, M. HOLST, R. SZYPOWSKI, AND Y. ZHU 

We assume that the linear problem (4.3) has the regularity 

IMIt,2 < C r \\u- U h \\ ,2 (4.4) 
for some t > 1. Then we have the following L 2 error estimate for Uh- 

Theorem 4.1 (L 2 Error Estimate). Let Assumptions (A1)-(A4) hold, and u G H s {yt) D 
Hq(Q) with s > 1 be the solution to (2.5), and Uh be the solution to (3.1). Suppose the 
dual problem (4.3) satisfies the regularity assumption (4.4) with t > 1. TTzen 

\\u - u h \\ 0> 2 < C (/i s+< - 2 + /i 2(s - 1} ) ||u||. j2 , (4.5) 

where C is independent of h. 

Proof. As in the proof of Theorem 3.4, recall the Taylor formula (3.8) for b{u) — b(v) : 

n-1 

b(u) - b(u h ) = j:M k \u){u - u h ) k + -r&< B >(0(« - u h )\ (4.6) 
^— ' k\ n\ 

k=l 

for some £ G ifg(fi). We have that ||6^(w)||o,oo is bounded for any k = 1, • • • , n — 
1 since |H|o,oo is bounded by Theorem 2.2. Moreover, ^^(Ollo.oo ^ K for some 
constant K > as stated in Assumption (A4). In particular, when n — 1, we may treat 
the problem as linear case, and the conclusion follows by the standard duality argument. 
Therefore, without loss of generality we assume n ^ 2 in the following proof. 
Taking v = u — Uh in (4.3) we obtain that 

\\ U ~ U h\\ll = a ( U ~ U h, W) + (b'(u)(u - U h ), W) 

= a(u -u h ,w - Wh) + {b'{u){u - u h ),w - w h ) 

+ a(u - u h ,w h ) + (b'{u){u - u h ),w h ). (4.7) 

Since Uh G Vh is the solution to the discrete semilinear problem (3.1), we have 

= a (u - u h , w h ) + (b(u) - b(u h ),w h ). 
By (4.6), the last two terms in (4.7) can be written as 

a(u-u h ,w h ) + (b'(u)(u-u h ),w h ) 

n-1 

= ~ E ^ b(k) («)(« - - ^y( &w (0(« - u h ) n , w h ). 

k=2 

Hence, by Holder inequality, the Poincare-Sobolev inequality (2.1) and the fact that 
||6^(w)||o,oo (k = l,"' ,n — l) and ||6^(£)l|o,oo are uniformly bounded, we obtain 

\\ u - u h\\l,2 ^ \\\ u - u h\H\w - W h \\\ + llfe'^Hcoollw - Wft||o,2||w ~ Wh\\o,2 

n-1 

- E uM k) ^ u - u ^ k ^) - -sb (n) mu - u h r, Wh ) 

kl nl 

k=2 

n 

< \\u - u h \\\\\\w - w h \\\ + \\u - u h \\^ p «\\w h \\ 0)P * /{p *_ k ) 

k=2 
n 

< \\u - u h \\\\\\w - w h \\\ + ^ i u - ^ll fe |kfello,p*/(p*-*)> ( 4 - 8 ) 

k=2 

where we choose p* = 6 for d = 3 and p* > n when d = 2. 
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To estimate the right hand side of (4.8), we take w h = P h w E Vh as the Galerkin 
projection of w, that is, Wh is the finite element solution to (4.3) on Vh- Then by the 
standard finite element approximation property for the linear equation (4.3), we have 

Ilk - w h \\\ < /i' _1 |Mk 2 ~ h^Wu - u h \\ G}2 , (4.9) 

where in the second inequality, we used the regularity assumption (4.4). Therefore, com- 
bining (4.9) with (3.12) in Corollary 3.5, we obtain that 

I it - u h \fw - w h \\\ < h s+t ~ 2 \\u\\ S:2 \\u - u h \\ 0t2 . (4.10) 

Now we turn to estimate the second term Ylk=2 \i u ~ ' u /i|| A: ||' u; /i||o,p*/(p*- fe ) m (4-8)- First 
of all, by the Poncare-Sobolev inequality (2.1), we have 

\\w h \W P */{p*-k) < C s [ -*-_ j} j l|V^||o )2 , k = 2, 3, • • • ,n. 
By (2.3) in Assumption (A4) on b, a similar argument as in Lemma 3.1 yields 

m \\^ w h\\l :2 < III III 2 = a(w h ,w h ) 

^ a(w h , w h ) + (b'(u)w h , w h ) = (u - u h , w h ) 

< ||« - W/i||o,2||W/i||o,2 < C s \\u - UhWozW^WhWo?, 

which gives us the estimate 

C 

||Vw ft || ,2 < — 1|« - Wh||o,2- 

m 

Therefore, we obtain that 

\\wh\\o,p*/(p*-k) ^ \\ u ~ u h \\ ,2, Vfc = 2, • • • ,n. 

Thus, we have 

n n 

J2\\U ~ U h l\ k \\w h \\ , p */( p *-k) < \\U - U h \\o,2j2 hkiS ~ 1) \\ U Wi^ (4 - U) 
k=2 k=2 

Combining inequalities (4.8), (4.10) and (4.1 1), the inequality (4.5) then follows. □ 

Remark 4.2. In case of full regularity, namely s = t = 2, then we have the optimal L 2 
error estimate. 

\\u - u h \\o, 2 < Ch 2 . (4.12) 

We now try to give a simple L°° error estimate. We first give the following general 
lemma. 

Lemma 4.3 (L°° — L 2 Lemma). Let u and Uh be the solutions to (2.5) and (3.1), respec- 
tively. Then we have 

||« - WfcHo.oo ;$ hif {\\u-v h \\o )00 + h~ d,2 \\u-Vh\\o,2) + hr d,2 \\u-u h \\o )2 . (4.13) 

Vh^Vh 

Proof We refer to [31, Remark 6.2.3] for a proof of this lemma. □ 
By the interpolation error estimates (4.1) and (4.2), we have 

inf {\\u - v h \\ 0)OO + hT d l 2 \\u - ^|| , 2 ) < h s - d l 2 \\u\\ s ^. (4.14) 

Vh&Vh 

The above discussion together with the L 2 error estimate (4.5) in Theorem 4.1 leads to 
the following L°° error as well as the discrete L°° bound: 
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Corollary 4.4 (L°° Error Estimate and A Priori L°° Bound). Let Assumptions (Al )-(A4) 
hold, and u G H S (Q) n Hq(Q) with s > 1 be the solution to (2.5), and Uh G Vh is the 
solution to (3.1). Suppose the dual problem (4.3) satisfies the regularity assumption (4.4) 
with t > 1. TTien we /zave the following L°° error estimate: 

\\u - UftHo.00 < (^" d/2 + ^ 2 + ^ 2(S " 1} ) ||«|| s> 2. (4.15) 
Moreover, if s > d/2 then for sufficiently small h we have 

Klkn < ^ (4-16) 
/or some constant B independent ofh. 

Proof. The inequality (4.15) follows by Lemma 4.3, (4.14) and (4.5). Notice that mm{s— 
d/2, s + t — 2, 2(s — 1)} > when s > d/2 and £ > 1, then a triangle inequality yields 
that 

IKHo,oo < IMIo.oo + \\U ~ U h \\ 0iOO . 

The inequality (4.16) then follows by Theorem 2.2 and (4.15) for sufficiently small h. 

□ 

Remark 4.5. In the case of second order linear elliptic PDE, the L°° error estimate 
in Corollary 4.4 has been discussed extensively in the literature; just a small sample 
includes [6, 16, 38, 35, 11, 36, 34, 32, 37]. We note that in the linear case, a better 
rate could be achieved by using more complicated techniques, as in the aforementioned 
works. 

5. Numerical Examples 

In this section we perform some numerical experiments in two and three dimensions 
to examine how the theory above is reflected in practice. In particular, we compute 
solutions to 

-Au + u p = f (5.1) 

subject to homogeneous boundary conditions, where / is chosen so that the solution is 
known and smooth. When d — 2, we choose p = 11 in (5.1); and when d = 3, we 
choose p as the critical exponent p — 5. The solutions are computed as a piecewise 
linear function on a series of meshes which are uniform refinements of one of two initial 
meshes; one mesh with good quality simplices and the other with large and small angles. 
The convergence profiles between the two sequences of solutions are compared in both 
H 1 and L°° norms. 

For the example in two dimensions, the initial meshes are shown in Figure 1 while 
the three-dimensional meshes are shown in Figure 2. For these meshes, we compute 
the triangle and tetrahedron shape metrics given by Knupp [27] which gives a number 
between and 1 quantifying the quality of the triangulation (with 1 given for isosceles 
simplices and for degenerate ones). These qualities are given in Table 1. 



Mesh 


Worst Quality 


Best Quality 


Good 2D 


0.495 


0.693 


Poor 2D 


0.213 


0.283 


Good 3D 


0.387 


0.632 


Poor 3D 


0.156 


0.417 



Table 1 . Shape Quality Metrics for the various meshes. 
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Figure 1 . Good and poor quality initial meshes used in the two dimen- 
sional examples. The poor quality mesh has a largest angle of approxi- 
mately 126° and smallest angle of approximately 8°. 




Figure 2. Good and poor quality initial meshes used in the three di- 
mensional examples. The poor quality mesh has a largest dihedral angle 
of approximately 115° and smallest dihedral angle of approximately 8°. 

Figure 3 and Figure 4 give the convergence results in both if 1 -norm and L°°-norm for 
the semilinear problems (5.1) in 2D and 3D, respectively. We observe that the quality of 
the mesh does not ruin the convergence rates in 2D, as long as we keep the mesh to be 
quasi-uniform. On the other hand, we do observe a little deterioration on the convergence 
rate in L°°-norm in the 3D example (see Figure 4) when we use a poor quality mesh. 
However, the errors in i^-norm in both 2D and 3D examples seem to be still quasi- 
optimal as predicted in Theorem 3.2. These results confirm our theoretical conclusions. 
We also observe that the convergence rate in L°°-norm is close to h 2 in both the 2D and 
3D examples, which indicates that the estimate in (4.15) is not optimal. Even though it 
is not optimal, we still got the a priori L°° bound of the discrete solution in (4. 16), which 
is important in the analysis of finite element approximation of nonlinear PDE. 

We also run a separate set of experiments in 2D in order to study (4.16) in Corol- 
lary 4.4. Specifically, starting with an initial good quality mesh, we compute the discrete 
solutions on successively worse meshes (produced using shortest edge bisections). Al- 
though we do not expect that the discrete solutions converge to the exact solution, we do 
hope that the L°° norm of the discrete solution remains bounded. Figure 5 shows the L°° 
norm of the discrete solution plotted against the size of the smallest angle in the mesh. 
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Figure 3. Convergence in H 1 and L°° norms for the two-dimensional 
example problem. 
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Figure 4. Convergence in H 1 and L°° norms for the three-dimensional 
example problem. 

This result confirms that the L°° norm of discrete solutions are uniformly bounded as 
predicted in Corollary 4.4. 



Smallest Angle 



Figure 5. Boundedness of the L°° norm of the discrete solution com- 
puted on successively worse meshes. 
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6. Conclusion 

In this article we considered a priori error estimates for a class of semilinear prob- 
lems with certain growth condition, which includes problems with critical and subcriti- 
cal polynomial nonlinearity in d space dimensions. Our motivation was that, while it is 
well-understood how mesh geometry impacts finite element interpolant quality (at least 
for d = 2 and d = 3). much more restrictive conditions on angles are needed to de- 
rive basic a priori quasi-optimal error estimates as well as a priori pointwise estimates 
for Galerkin approximations. These angle conditions, which are particularly difficult to 
satisfy in three dimensions in any type of unstructured or adaptive setting, are needed in 
order to gain pointwise control of the nonlinearity through discrete maximum/minimum 
principles. Our goal in the article was to show how to derive these types of a priori es- 
timates without requiring the discrete maximum/minimum principles, hence eliminating 
the need for restrictive angle conditions. 

To this end, in Section 2 we described a class of semilinear problems, and reviewed 
the a priori L°° bounds of the continuous solution through maximum/minimum prin- 
ciples using the De Giorgi iterative method (or Stampacchia truncation method). We 
then developed a basic quasi-optimal a priori error estimate for Galerkin approxima- 
tions in Section 3, where the nonlinearity was controlled by using only a local Lipschitz 
property rather than through pointwise control of the discrete solution. In this way, we 
avoid of using discrete maximum principle, which requires certain angle conditions. In 
particular, we showed that the local Lipschitz property in fact holds for nonlinearities 
satisfying certain growth condition, which includes the critical exponent cases. We then 
used some well-known results in finite element approximation theory in Section 4 to 
show that (under some minimal smoothness assumptions) that the a priori error estimate 
is itself enough to give L°° control the discrete solution, without the need for restrictive 
angle conditions that would be required to obtain a discrete maximum principle. 
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